42 research outputs found

    Approaching Globally Optimal Energy Efficiency in Interference Networks via Machine Learning

    Full text link
    This work presents a machine learning approach to optimize the energy efficiency (EE) in a multi-cell wireless network. This optimization problem is non-convex and its global optimum is difficult to find. In the literature, either simple but suboptimal approaches or optimal methods with high complexity and poor scalability are proposed. In contrast, we propose a machine learning framework to approach the global optimum. While the neural network (NN) training takes moderate time, application with the trained model requires very low computational complexity. In particular, we introduce a novel objective function based on stochastic actions to solve the non-convex optimization problem. Besides, we design a dedicated NN architecture for the multi-cell network optimization problems that is permutation-equivariant. It classifies channels according to their roles in the EE computation. In this way, we encode our domain knowledge into the NN design and shed light into the black box of machine learning. Training and testing results show that the proposed method without supervision and with reasonable computational effort achieves an EE close to the global optimum found by the branch-and-bound algorithm. Hence, the proposed approach balances between computational complexity and performance

    Altruistic Control of Connected Automated Vehicles in Mixed-Autonomy Multi-Lane Highway Traffic

    Get PDF
    We consider the problem of altruistic control of connected automated vehicles (CAVs) on mixed-autonomy multi-lane highways to mitigate moving traffic jams resulting from car-following dynamics of human-driven vehicles (HDVs). In most of the existing studies on CAVs in multi-lane settings, vehicle controller design philosophy is based on a selfish driving strategy that exclusively addresses the ego vehicle objectives. To improve overall traffic smoothness, we propose an altruistic control strategy for CAVs that aims to maximize the driving comfort and traffic efficiency of both the ego vehicle and surrounding HDVs. We formulate the problem of altruistic control under a model predictive control (MPC) framework to optimize acceleration and lane change sequences of CAVs. In order to efficiently solve the resulting non-convex mixed-integer nonlinear programming (MINLP) problem, we decompose it into three non-convex subproblems, each of which can be transformed into a convex quadratic program via penalty based reformulation of the optimal velocity with relative velocity (OVRV) car-following model. Simulation results demonstrate significant improvements in traffic flow via altruistic CAV actions over selfish strategies on both single- and multi-lane roads

    Learning Physical-Layer Communication with Quantized Feedback

    Get PDF
    Data-driven optimization of transmitters and receivers can reveal new modulation and detection schemes and enable physical-layer communication over unknown channels. Previous work has shown that practical implementations of this approach require a feedback signal from the receiver to the transmitter. In this paper, we study the impact of quantized feedback on data-driven learning of physical-layer communication. A novel quantization method is proposed, which exploits the specific properties of the feedback signal and is suitable for nonstationary signal distributions. The method is evaluated for linear and nonlinear channels. Simulation results show that feedback quantization does not appreciably affect the learning process and can lead to similar performance as compared to the case where unquantized feedback is used for training, even with 1-bit quantization. In addition, it is shown that learning is surprisingly robust to noisy feedback where random bit flips are applied to the quantization bits

    Freeway Traffic Jam Mitigation via Connected Automated Vehicles

    Get PDF
    We consider the problem of altruistic control of connected automated vehicles (CAVs) on multi-lane highways to mitigate phantom traffic jams resulting from car-following dynamics of human-driven vehicles (HDVs). In most of the existing studies on CAVs in multi-lane settings, vehicle controller design philosophy is based on a selfish driving strategy that exclusively addresses the ego vehicle objectives. To improve overall traffic smoothness, we propose an altruistic control strategy for CAVs that aims to maximize the driving comfort and traffic efficiency of both the ego vehicle and surrounding HDVs. We formulate the problem of altruistic control under a model predictive control (MPC) framework to optimize acceleration and lane change sequences of CAVs. Simulation results demonstrate significant improvements in traffic flow via altruistic CAV actions over selfish strategies

    Communication Scheduling by Deep Reinforcement Learning for Remote Traffic State Estimation with Bayesian Inference

    Get PDF
    Traffic awareness is the prerequisite of autonomous driving. Given the limitation of on-board sensors (e.g., precision and price), remote measurement from either infrastructure or other vehicles can improve traffic safety. However, the wireless communication carrying the measurement result undergoes fading, noise and interference and has a certain probability of outage. When the communication fails, the vehicle state can only be predicted by Bayesian filtering with a low precision. Higher communication resource utilization (e.g., transmission power) reduces the outage probability and hence results in an improved estimation precision. The power control subject to an estimate variance constraint is a difficult problem due to the complicated mapping from transmit power to vehicle-state estimate variance. In this paper, we develop an estimator consisting of several Kalman filters (KFs) or extended Kalman filters (EKFs) and an interacting multiple model (IMM) to estimate and predict the vehicle state. We propose to apply deep reinforcement learning (DRL) for the transmit power optimization. In particular, we consider an intersection and a lane-changing scenario and apply proximal policy optimization (PPO) and soft actor-critic (SAC) to train the DRL model. Testing results show satisfactory power control strategies confining estimate variances below given threshold. SAC achieves higher performance compared to PPO

    RISnet: A Scalable Approach for Reconfigurable Intelligent Surface Optimization with Partial CSI

    Full text link
    The reconfigurable intelligent surface (RIS) is a promising technology that enables wireless communication systems to achieve improved performance by intelligently manipulating wireless channels. In this paper, we consider the sum-rate maximization problem in a downlink multi-user multi-input-single-output (MISO) channel via space-division multiple access (SDMA). Two major challenges of this problem are the high dimensionality due to the large number of RIS elements and the difficulty to obtain the full channel state information (CSI), which is assumed known in many algorithms proposed in the literature. Instead, we propose a hybrid machine learning approach using the weighted minimum mean squared error (WMMSE) precoder at the base station (BS) and a dedicated neural network (NN) architecture, RISnet, for RIS configuration. The RISnet has a good scalability to optimize 1296 RIS elements and requires partial CSI of only 16 RIS elements as input. We show it achieves a high performance with low requirement for channel estimation for geometric channel models obtained with ray-tracing simulation. The unsupervised learning lets the RISnet find an optimized RIS configuration by itself. Numerical results show that a trained model configures the RIS with low computational effort, considerably outperforms the baselines, and can work with discrete phase shifts

    Pro-social control of connected automated vehicles in mixed-autonomy multi-lane highway traffic

    Get PDF
    We propose pro-social control strategies for connected automated vehicles (CAVs) to mitigate jamming waves in mixed-autonomy multi-lane traffic, resulting from car-following dynamics of human-driven vehicles (HDVs). Different from existing studies, which focus mostly on ego vehicle objectives to control CAVs in an individualistic manner, we devise a pro-social control algorithm. The latter takes into account the objectives (i.e., driving comfort and traffic ef ficiency) of both the ego vehicle and surrounding HDVs to improve smoothness of the entire observable traffic. Under a model predictive control (MPC) framework that uses acceleration and lane change sequences of CAVs as optimization variables, the problem of individualistic, altruistic, and pro-social control is formulated as a non-convex mixed-integer nonlinear program (MINLP) and relaxed to a convex quadratic program through converting the piece-wise-linear constraints due to the optimal velocity with relative velocity (OVRV) car-following model into linear constraints by introducing slack variables. Low-fidelity simulations using the OVRV model and high-fidelity simulations using PTV VISSIM simulator show that pro-social and altruistic control can provide significant performance gains over individualistic driving in terms of efficiency and comfort on both single- and multi-lane roads

    Decentralized Scheduling for Cooperative Localization With Deep Reinforcement Learning

    Get PDF
    Cooperative localization is a promising solution to the vehicular high-accuracy localization problem. Despite its high potential, exhaustive measurement and information exchange between all adjacent vehicles are expensive and impractical for applications with limited resources. Greedy policies or hand-engineering heuristics may not be able to meet the requirement of complicated use cases. In this paper, we formulate a scheduling problem to improve the localization accuracy (measured through the Cram\ue9r-Rao lower bound) of every vehicle up to a given threshold using the minimum number of measurements. The problem is cast as a partially observable Markov decision process and solved using decentralized scheduling algorithms with deep reinforcement learning, which allow vehicles to optimize the scheduling (i.e., the instants to execute measurement and information exchange with each adjacent vehicle) in a distributed manner without a central controlling unit. Simulation results show that the proposed algorithms have a significant advantage over random and greedy policies in terms of both required numbers of measurements to localize all nodes and achievable localization precision with limited numbers of measurements

    Cooperative localization with angular measurements and posterior linearization

    Get PDF
    The application of cooperative localization in vehicular networks is attractive to improve accuracy and coverage of the positioning. Conventional distance measurements between vehicles are limited by the need for synchronization and provide no heading information of the vehicle. To address this, we present a cooperative localization algorithm using posterior linearization belief propagation (PLBP) utilizing angle-of-arrival (AoA)-only measurements. Simulation results show that both directional and positional root mean squared error (RMSE) of vehicles can be decreased significantly and converge to a low value in a few iterations. Furthermore, the influence of parameters for the vehicular network, such as vehicle density, communication radius, prior uncertainty, and AoA measurements noise, is analyzed

    A survey on 5G massive MIMO Localization

    Get PDF
    Massive antenna arrays can be used to meet the requirements of 5G, by exploiting different spatial signatures of users. This same property can also be harnessed to determine the locations of those users. In order to perform massive MIMO localization, refined channel estimation routines and localization methods have been developed. This paper provides a brief overview of this emerging field
    corecore